Collaborative training of heterogeneous reinforcement learning agents in environments with sparse rewards: what and when to share?
نویسندگان
چکیده
In the early stages of human life, babies develop their skills by exploring different scenarios motivated inherent satisfaction rather than extrinsic rewards from environment. This behavior, referred to as intrinsic motivation, has emerged one solution address exploration challenge derived reinforcement learning environments with sparse rewards. Diverse approaches have been proposed accelerate process over single- and multi-agent problems homogeneous agents. However, scarce studies elaborated on collaborative frameworks between heterogeneous agents deployed into same environment, but interacting instances latter without any prior knowledge. Beyond heterogeneity, each agent’s characteristics grant access only a subset full state space, which may hide strategies optimal solutions. this work we combine ideas motivation transfer learning. Specifically, focus sharing parameters in actor-critic model architectures combining information obtained through aim having more efficient faster We test our experiments performed modified ViZDooM’s My Way Home scenario, is challenging its original version allows evaluating heterogeneity Our results reveal ways framework little additional computational cost can outperform an independent knowledge sharing. Additionally, depict need for modulating correctly importance avoid undesired agent behaviors.
منابع مشابه
The Collaborative Learning in the e-Learning Environments
Introduction: The collaborative learning and interactive electronic-learning (e- learning) is one of the key factors in education system success. This study examined the collaborative e-learning in the framework of constructivism theory. Methods: This is a review article. The databases such as Scientific Information Databases, Magiran, Science Direct, and Google Scholar were reviewed. Also,...
متن کاملDesigning collaborative learning model in online learning environments
Introduction: Most online learning environments are challenging for the design of collaborative learning activities to achieve high-level learning skills. Therefore, the purpose of this study was to design and validate a model for collaborative learning in online learning environments. Methods: The research method used in this study was a mixed method, including qualitative content analysis and...
متن کاملImitation and Reinforcement Learning in Agents with Heterogeneous Actions
We study the problem of accelerating reinforcement learning (RL) through the observation and implicit imitation of expert agents (mentors) acting in the same domain. In this paper, we consider problems that arise when the learner and mentor have heterogeneous actions. We extend an earlier implicit imitation model to allow for feasibility testing (determining whether a specific mentor action can...
متن کاملa comparative study of language learning strategies employmed by bilinguals and monolinguals with reference to attitudes and motivation
هدف از این تحقیق بررسی برخی عوامل ادراکی واحساسی یعنی استفاده از شیوه های یادگیری زبان ، انگیزه ها ونگرش نسبت به زبان انگلیسی در رابطه با زمینه زبانی زبان آموزان می باشد. هدف بررسی این نکته بود که آیا اختلافی چشمگیر میان زبان آموزان دو زبانه و تک زبانه در میزان استفاده از شیوه های یادگیری زبان ، انگیزه ها نگرش و سطح مهارت زبانی وجود دارد. همچنین سعی شد تا بهترین و موثرترین عوامل پیش بینی کننده ...
15 صفحه اولReinforcement Learning Without Rewards
Machine learning can be broadly defined as the study and design of algorithms that improve with experience. Reinforcement learning is a variety of machine learning that makes minimal assumptions about the information available for learning, and, in a sense, defines the problem of learning in the broadest possible terms. Reinforcement learning algorithms are usually applied to “interactive” prob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neural Computing and Applications
سال: 2022
ISSN: ['0941-0643', '1433-3058']
DOI: https://doi.org/10.1007/s00521-022-07774-5